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@ While some of the characteristics of medical pathologies or industrial defects in images are subject to 
classification, many aspects of these abnormalities do not lend themselves to precise programmable char- 
acterizations. Neural networks have been useful in recognizing patterns in a number of applications involving 
multiple variables whose precise interactions are not well-understood or quantifiable. In order to aid the medical 
imaging personnel, industrial troubleshooters and others in locating potential . anomalies in an image, this 
invention employs neural network analysis to help identify regions of suspected anomalies. 
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Cross Reference to Related Applications This application is a continuation-in-part of application serial 
number 07/664,906, filed March 5, 1991. 

BACKGROUND OF THE INVENTION 

5 

Field of the Invention The present invention relates to the computer-aided analysis of digitized images 
and, in particular, to the location of possible anomalies in medical diagnostic and/or industrial images. 

Description of the Related Art It has become an established and wide-spread medical practice to 
perform mammograms to determine the possible presence of breast cancer or other pathological con- 
io ditions. The amount of time spent by health care professionals in scanning a large number of screening 
exam images tends to be very short and statistics indicate that diagnostic accuracy may suffer. Also, some 
features of these radiological images are such that their significance can only be recognized by specialized 
experts. 

While diagnostic uncertainty may not exist in those instances in which a tumor or cancerous tissue is 
75 well developed and, thus, clearly visible to all, it will be appreciated that in the very early stages of the 
development of such tissue abnormalities, a reliable diagnosis based upon a brief examination of radiolog- 
ical images is much more difficult. Yet early diagnosis is a very important factor in arresting and effectively 
treating cancerous tissues and tissues which exhibit conditions precursory to cancer, particularly breast 
cancer. 

20 In addition to the benefits of early determination of pathology in medical diagnostic images, such as 
mammograms, many types of defects in industrial images must also be detected during product manufac- 
ture to prevent catastrophic failure, e.g. in airplane turbine blades. Many types of anomalies in industrial 
materials have been identified and classified by computer analysis of an X-ray image. For example, in some 
cases, metal cracks or incomplete welds have precisely identifiable characteristics which may be pro- 

25 grammed into a standard algorithmic computer for automated analysis. However, in most cases they 
cannot. Still other types of defects, such as corrosion pits and delami nations in honeycombed or layered 
materials, are even more difficult to characterize due to their variety of manifestations. 

Both the medical diagnostic images and the latter variety of industrial x-ray images are not easily and 
simply classifiable and do not lend themselves to precise programmable characterizations, such as an 

30 expert system. Conventional algorithmic computers excel in applications where the knowledge can be 
readily represented by means of an explicit set of rules, e.g. a decision tree. Where this is not the case, 
conventional computers encounter difficulty. While conventional computers can execute an algorithm much 
more rapidly than any human, they are challenged to match human performance in nonalgorithmic tasks 
such as visual pattern recognition. 

35 However, parallel distributed processing networks, also known as "neural networks", have been shown 
to be useful in recognizing patterns in a number of applications involving multiple variables whose precise 
interactions are not well-understood or quantifiable. 

In contrast to algorithmic computer systems (including so-called "expert systems"), a neural net 
computing system is not formulated to exhibit an explicit algorithm or a set of explicit rules. Instead, a 

40 neural network is "trained" to recognize patterns in input data by an iterative adjustment of the connection 
weights associated with each processing element. These connection weights are adjusted to minimize a 
preselected output error function. This adjustment of the weights may be accomplished by various well- 
known techniques. 

In traditional image analysis (and/or processing), background subtraction and image normalization have 
45 been used to obtain better image resolution with reduced noise. However, when at least some of the areas 
of interest in the data are sufficiently close to the noise, removal of background from the entire image and 
subsequent normalization may not satisfactorily distinguish such data from the overall noise. There is thus a 
need in image analysis for an improvement in the separation of various areas of interest from the effects of 
overall noise in the image. 

50 

SUMMARY OF THE INVENTION 

The present invention employs analysis by a parallel distributed processing system to accurately 
provide information about the location of possible anomalies in medical diagnostic and industrial images. 
55 The invention is more particularly described as a method for computer-aided detection of possible 
anomalies in a digitized image comprising a plurality of M x N picture elements each representing an 
optical density in the digitized image, the method comprising the steps of: 
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a) subdividing the digitized image into a plurality of predetermined regions each comprising m x n 
picture elements, where m < M and n < N; 

b) subtracting background from each predetermined region of the digitized image; 

c) selecting a subregion comprising pxq picture elements, where p < m and q < n; 
5 d) normalizing the image data from the pxq subregion; 

e) using a neural network system, analyzing each predetermined subregion of the digitized image to 
recognize any pattern indicative of an occurrence of a possible anomaly, 

the neural network system comprising at least two member neural networks each trained to 
recognize a particular preselected anomaly type of a preselected size and to produce an output signal 
70 value indicative of the presence of said preselected anomaly type; 

f) comparing the output values of each of the member neural networks to a first predetermined threshold 
value above which the presence of a possible anomaly is indicated; 

g) comparing the output values that exceed the first threshold(s) to select the maximum signal value; 

h) comparing the maximum signal value to a second predetermined threshold value above which the 
75 presence of a possible anomaly is indicated; 

i) based upon these comparisons, determining the location within the digitized image of each possible 
anomaly; 

j) using a clustering analysis, identifying each vicinity on the digitized image at which a cluster of 
locations of possible anomalies occurs; and, 
20 k) marking the location on an original image corresponding to the digitized locations of the clusters. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The invention will be more fully understood from the following detailed description thereof, taken in 
25 connection with the accompanying drawings, which form a part of this application and in which: 

Figure 1 represents a block diagram of the overall process in accordance with the present invention; 
Figure 2 is a more detailed block diagram of the analysis executed by a digital computer operating in 
accordance with a program in accordance with the present invention; 

Figure 3A is a plot of optical density values versus distance along a line Z-Z in the digitized image of 
30 Figure 2, while Figure 3B is a representation of the plot of Figure 3A following background subtraction. 

DETAILED DESCRIPTION OF THE INVENTION 

Throughout the following detailed description, similar reference numerals refer to similar elements in all 

35 Figures of the drawings. 

The present invention, illustrated by an overall block in Figure 1 , relates to a method of computer-aided 
analysis of an analog image 2 to determine the presence of anomalies therein. The anomalies are indicted 
by the reference character A. Although the analog image 2 is indicated as an x-ray film containing a 
medical diagnostic image such as a mammogram, it should be understood that the invention may be used 

40 with any form of medical diagnostic image containing possible pathologies, such as X-ray images, 
computerized axial tomography (CAT) scans, positron emission tomography (PET) scans, magnetic reso- 
nance images (MRIs), and sonogram images. In addition, the invention may be applied to detect defects in 
various industrial images, such as radiographs or sonogram images of parts, equipment, and/or welds. It is 
noted that, as used herein, the term "anomaly(ies)" is meant to encompass within it the terms "pathology- 

45 (ies)" and "defector. 

As a first step the analog image 2 is raster-scanned and digitized by a digitizing apparatus 4, such as a 
scanning laser digitizer 4. The resultant digitized output image 2* from the digitizer 4 comprises a matrix of 
M rows by N columns (M x N) of adjacent individual picture elements ("pixels"). In a typical medical 
diagnostic application of the invention the digitized image comprises a matrix on the order of four thousand 

so by four thousand five hundred (4000 x 4500) pixels. Each element in the M x N matrix is an optical density 
value. 

Scanned pixel sizes generally range in size from ten micrometers (10um) to two hundred micrometers 
(200um). For medical images, such as a mammogram, the scanning aperture of the digitizer 4 is chosen to 
produce pixels on the order of thirty micrometers (30um) to seventy micrometers (70um), and more 
55 preferably, fifty micrometers (50um). 

In medical or industrial applications where an image 2* may pre-exist in a digital format, as in the case 
of some digital modalities such as PET scans, CAT scans, MRIs and sonograms, scanning to digitize the 
image is not required. 
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The digitized image 2' from the digitizer 4 is submitted for image analysis to a digital computer 10 
operating in accordance with this invention. The digitized image 2* may be stored temporarily in a memory 
buffer in the digitizer 4 and then transferred to the memory of the computer 10 or may be immediately 
transferred and stored in the memory of computer 10. The digitized image 2' is stored as an M x N matrix 

5 of optical density values representing the sampled adjacent pixel locations in the image 2'. 

A more detailed block diagram of the analysis steps executed within the computer 10 is shown in 
Figure 2. For analysis purposes the digitized image 2* from the digitizer 4 is subdivided into a predeter- 
mined number of regions, each of which contains a matrix of m rows by n columns (m x n), where m < M 
and n < N. In the preferred embodiment, the matrix defining each region of the image is on the order of 

io sixty four by sixty four (64 x 64) pixels. 

In the preferred case, as is suggested in Figure 2, each matrix representing a region of the overall 
digitized image 2' may be envisioned as being generated as the result of the raster scan of an m x n (64 x 
64) pixel "window" 12 moving across the digitized image 2\ Starting from the upper left hand corner of the 
image 2* the window 12 may be horizontally incremented across the image 2* by any convenient 

15 predetermined number of pixels. Preferably, the window 12 is incremented one pixel at a time. Similarly, at 
the end of each horizontal scan, the window 12 is vertically incremented by any convenient predetermined 
number of pixels (again, preferably, one pixel at a time). 

In some instances, to minimize the digital computational requirements and the analysis time required 
for interpretation, the original digitized image is subjected to segmentation analysis to identify the image 

20 within the window. The output of this process is the set of pixel coordinates defining the boundary of the 
image which determines the region of interest for further image processing. Raster scanning is then 
performed only over the region of interest defined by the segmentation analysis process. 

Each subdivided region of the image 2' as defined by the window 12 is preprocessed in preparation for 
analysis, as generally indicated by the block 14. In one aspect image preprocessing 14 includes 

25 background subtraction, as indicated at 16. Another aspect of image preprocessing is a two-stage 
normalization, as indicated at 1 8. 

After image preprocessing, image analysis is performed using a parallel distributed processing system 
("neural network system") 20 comprised of a plurality of at least two, or more, trained member parallel 
distributed processing networks 22 ("neural networks"). 

30 Background subtraction is performed to narrow the range of optical density information that must be 
analyzed. As seen in Figure 2, one region of the image 2* as defined by the matrix window 12 is shown to 
contain an anomaly A. Figure 3A is a plot of the optical density values versus distance along the line Z-Z 
extending through the window with the anomaly A therein. Sections 50 and 54 of the plot of Figure 3A show 
a gradual decrease in optical density readings across the window 12. Section 52 of the plot represents a 

35 significant change in optical density due to the presence of the anomaly A. After background subtraction, as 
seen in Figure 3B, sections 50' and 54' respectively represent the leading and trailing sections 50, 54 of the 
original plot (Figure 3A), while section 52' represents the optical density difference of the anomaly A. 

Any one of a number of ways of background subtraction from a data sample known in the art may be 
used. For example, the "least squares" approach may be used. 

40 For each pixel position in the region defined by the window 12 a plane is fitted by a least-squares 
algorithm to all of the pixels within the window: 

Dg = ao + ai i + a 2 j 

45 where D:barg is the estimated pixel value at the position (i,j) in the sixty-four by sixty-four pixel window 12 
and the coefficients (ao, ai, a 2 ) are obtained by least-squares fitting. The estimated planar fit is then 
subtracted from the original pixel value in the upper left hand pixel closest to the window center: 

AD M = - D u 

50 

where Dg is the original pixel value at (i,j). The resultant pixel value (optical density difference. ADg) is then 
stored in a secondary image at its respective coordinates. 

Through use of the least squares plane the background may be subtracted from the original optical 
density values of matrix window 1 2. 
55 In the preferred embodiment, the area of the anomaly A should be significantly smaller than the area of 
the region of the image 2' defined by the window 12. For example, a window 12 may be sized so that the 
largest pathology or defect examined would be perhaps a factor of ten times smaller than the window 12. In 
such a case most of the pixels are associated with the background. Thus, in effect, a high-pass spatial 
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frequency filtering operation is performed. When the least squares plane is subtracted, the optical density 
values in the region of interest increase in proportion to the background optical density values and, thereby, 
the signal to noise ratio is improved (Figure 3B). 

Following background subtraction a subregion 12* of p x q pixels, where p < m and q < n, is selected 

5 from within the m x n region 12, preferably from the center thereof. In the preferred case subregion 12' is 
on the order of sixteen by sixteen pixels. As indicated at 18 in Figure 2, a two-stage normalization operation 
is then performed on the pixels in the subregion 12*. 

The first normalization stage is a scale invariance transformation 18A. When imaged, a specific anomaly 
will exhibit quantitatively different spatial optical density profiles depending on different exposure conditions, 

ro film sensitivities, and film sensitometries. To reduce the computational classification requirements for 
pattern recognition of suspect anomalies, a normalization procedure is required for identifying anomalies of 
the same type, regardless of image capture conditions. In addition, even if the exposure conditions, film 
sensitivities, and film sensitometries are the same, there are classes of anomalies that exhibit different 
optical density profiles, which, after undergoing a linear transformation, will exhibit the same normalized 

75 spatial profile. Again it is advantageous to consider such anomalies as belonging to the same class. 

A straightforward method for performing this classification normalization is by transforming the vector 
quantized image onto a unit sphere in n-dimensional pixel space for each p x q window to be analyzed for 
the presence of an anomaly. A specific anomaly shape class is then recognized by its phase angle in this 
normalized n-dimensional space. 

20 The high-pass filter image undergoes a second raster scan using the smaller p x q window 12*. Using a 
fifty micrometer (50um) pixel scanning aperture diameter this smaller (16 x 16) pixel window 12* is sufficient 
to allow anomalies of approximately fifty to two hundred fifty micrometers (50-250um) diameter to be 
recognized. The upper left hand pixel nearest the center of the window is again tagged. All of the pixels 
within this window are assumed to form an orthogonal vector space of two hundred fifty six (256) 

25 dimensions, where each dimension represents the range of possible preprocessed optical densities (AD,j 
values) for its respective pixel. All pixel values within the window are linearly transformed to fit the 
normalized range of zero to one: 

Z (ADjj)2 = 1 ; 9i,j = cos-1 ADjj 

ij=l 

35 

where Sij is the direction arccosine of the normalized pixel value at (i.j). The spatial shape of an object 
within the original window is now encoded as a phase angle vector, 9, within this 256-dimensional vector 
space and the tip of each vector representing an object lies on a unit sphere. 

The second normalization stage 18B is a dynamic range normalization, wherein the set of transformed 

40 optical density difference values of the subregion 12' are normalized to fit the input dynamic range of each 
member neural network 22 in the neural network system 20. For example, if the functions employed in any 
hidden and output layers of the member neural networks yield values within the ranges of ~1.0 to + 1.0, the 
values at the input layer should be normalized to fit within this range. 

It should be noted that both the background subtraction 16 (performed on the region 12) and the two- 

45 stage normalization (18 performed on the subregion 12') are local to each predetermined region or 
subregion, as the case may be. Since background subtraction 16 and two-stage normalization 18 are 
performed locally, small changes within each region or subregion are not masked or hidden by the overall 
noise characteristics of the image 2*. 

The actual anomaly recognition is performed using the neural network system 20 comprised of a 

so plurality of at least two trained member neural networks 22A through 22N. Each member neural network 
22A through 22 N is, in the preferred instance, a feedforward neural network having at least three layers, an 
input layer 23I, an intermediate or "hidden" layer 23H, and an output layer 23L. Each node in each layer is 
connected through connection weights to every node in an adjoining layer. A three-layer neural network is 
the minimum network topology required to obtain an arbitrary nonlinear mapping of input to output. Each 

55 input and hidden layer 241 and 24H, respectively, in each member neural network 22A through 22N is 
comprised of a plurality of nodes. Each output layer 24L in each member neural network 22A through 22 N 
is comprised of one or more nodes. Each member neural network 22A through 22N should be fully 
connected. 
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Each node, or processing element, in the input layer has, as its input, a stored digital value representing 
the preprocessed values of each element in the subregion 12'. Each node in each succeeding layer has as 
its input the signal present at the output of the nodes of an adjoining layer, as multiplied by their associated 
connection weights. Each node has a constant bias input, or offset, value which is multiplied by its own 
5 weighting factor. The inputs to each node are summed and scaled by a nonlinear transformation to produce 
an output signal value. In the preferred case the hyperbolic tangent function, whose output range is "1 to 
+ 1, is employed as the nonlinear function. The topology, structure, and organization of a three-layer neural 
network is fully discussed in United States Patent 5,046,020 (Rlkin), which is hereby incorporated by 
reference. 

10 Each member neural network 22A through 22N is trained to recognize a particular anomaly type and/or 
size. For example, in the analysis of mammograms, a subsystem comprising one or more member neural 
network(s) 22 detects a first anomaly type, such as a microcalcification, while a second subsystem 
comprising one or more other member neural network(s) may be trained to recognize a second anomaly 
type, such as a lesion. Furthermore, each subsystem of member neural networks 22 may detect a given 

75 type of anomaly whose size falls within particular predetermined size ranges, e. g. fifty (50um) to one 
hundred micrometers (100um), one hundred micrometers <100um) to one hundred fifty micrometers 
(150um), etc. Such a grouping of member neural networks to detect various sizes of an anomaly of a given 
type provides a measure of redundancy in detecting that particular anomaly type and, thereby, adds 
significantly to the accuracy of the results. 

20 Training may be accomplished using the back propagation paradigm as discussed in "Parallel 
Distributed Processing", D. E. Rumelhart, G. E. Hinton, and R. J. Williams, Volume 1, Chapter 8, "Learning 
Internal Representations by Error Propagation", Bradford Books, 1986. The software package ANSim, 
version 2.30 (© 1988, Science Applications International Corporation, San Diego, CA), may be employed for 
this purpose. Alternately, the training method disclosed in United States Patent 5,046,020 (Fiikin), assigned 

25 to the assignee of the present invention, wherein the connection weights are generated using stiff differential 
equations may be used. 

As is well understood by those skilled in the art, training of each member neural network uses a 
"training set" containing an adequate sampling of digitized images of regions of analog images that either 
manifest an anomaly of interest or that are anomaly-free. A record is kept correlating the digitized sample 

30 images to their location in the original analog images. In addition, in the preferred embodiment, a value is 
stored with each training sample to indicate whether an anomaly is present (e.g. + 1 .0) or to indicate that a 
window is anomaly-free (e. g. -1 .0). 

As the neural network is interpolative in nature, its level of accuracy when in ultimate use depends in 
large part on whether the data used in training the member neural networks is a statistically significant 

35 representative sampling. Some of the factors to consider in selecting a statistically significant number of 
samples to adequately train a member neural network to identify a particular anomaly include the type of 
anomaly, the variation in characteristics of the anomaly, the size of the neural network (number of weights, 
number of layers, number of nodes), and the noise in the analog images. However chosen, the number of 
samples in a training set must be a statistically significant number of samples having a particular 

40 characteristic (e.g. size, shape, intensity profile, etc.) to present to a neural network to allow it be adequately 
trained. "Adequate training" means trained to recognize an anomaly within a user-defined accuracy. Each 
neural network is separately trained to identify a predetermined size range of a predetermined type of 
anomaly, with all other member neural networks de-activated, so that a given member network will respond 
most strongly only when the characteristics that it has elicited from its training set are present in an image 

45 outside the training set. 

The iterative process through which initial weight values are incrementally changed until the error in the 
output of the network ceases to be reduced significantly (i. e., a local minimum is reached) is discussed in 
the above-referenced United States Patent 5,046,020 (Fiikin) and need not be repeated here. 

Once a member neural network is trained, testing of each member neural network must be undertaken 

so to determine if the local minimum produced through training will satisfy desired accuracy requirements for 
anomaly identification. Testing will determine if further training of the various member neural networks is 
required. 

As indicated in Figure 2 at block 24, the output signal values of each member neural network is 
compared to a first predetermined anomaly threshold value above which the presence of a possible 
55 anomaly of the given type is indicated. Also, a user may define separate anomaly threshold values for each 
member neural network. Alternatively, a single threshold value may be defined to which the outputs of all 
member neural networks are compared. 



6 




EP 0 610 805 A2 



Those output signal values exceeding the first threshold value(s) are compared to each other by a 
winner-take-all (WTA) algorithm 26 to select the maximum signal value. The output value selected by the 
WTA algorithm 26 is compared to a second predetermined threshold, above which the presence of a 
possible anomaly of the given type is also indicated. The comparison to the second threshold is shown in 
5 Figure 2 at block 28. Depending upon the magnitude of the second threshold value the output of the 
thresholded WTA 26 may detect a fraction of image features which do not actually represent anomalies 
("false positives") or may fail to detect a fraction of image features which do actually represent anomalies 
("false negatives"). The second threshold must be larger than or equal to the minimum value of the first 
threshold(s). 

w In assigning either of the thresholds 24 or 28 it should be noted that the lower its value, the higher is 
the number of false positives. Conversely, the higher the threshold values the greater the possibility of 
yielding false negatives. For example, given an output range of -1 .0 to +1 .0. a user may set the threshold 
for detecting a particular anomaly at +0.3. If the single value from the WTA algorithm 26 exceeds +0.3, a 
particular anomaly type may be present within subregion 12', e. g., 95% of the time. That is, in 95% of 

15 cases a physician's diagnosis will agree that the neural network detected a particular pathology. If greater 
accuracy is needed, the threshold level may be adjusted higher, e.g. to +0.4, during network training to 
achieve, say, 98% accuracy. However, the false negative rate will also increase. That is, a number of actual 
pathologies having presented output signal values between +0.3 and +0.4 will now remain undetected at 
the higher threshold value. 

20 As indicated at block 30 in Figure 2 if an anomaly is detected the location within the subregion 12' 
containing the anomaly is recorded for future analysis. In the preferred embodiment, the location of the 
subregion 12' is defined by the coordinates of the pixel in the upper left hand corner. 

Each scan of the regions of the image defined by the stepwise raster scanning of the window 12 is 
analyzed in the manner discussed until either the entire digital image or the region of interest defined by 

25 the segmentation process has been examined. After the entire image has been thus analyzed by the neural 
network system, the coordinates of all detected possible anomalies are transmitted to block 32 for further 
analysis. For instance, a two-stage clustering analysis algorithm may be applied which detects the presence 
of a predetermined number (e. g., more than three) of possible anomalies within a predetermined radius. 
The two-stages of the clustering analysis include microclustering binding and macroclustering binding. 

30 Microclustering binding of the thresholded image is required to remove multiple counts of the same 
anomaly. Since the nonlinear convolution transform in the second scan involved the p x q (16 x 16) window 
12', microclustering requires a third image raster scan with the same window size. All locations in which an 
anomaly occurs within the window are interpreted as belonging to the same anomaly. A final map is then 
generated, representing the location of all recognized individual anomalies in the image. 

35 The interpretive significance of anomalies within the image 2 depends on many factors, but one of 
primary significance is related to the number of anomalies within an area of specified size. Macroclustering 
accepts the possibility of an anomaly when at least four positive events are detected within a diameter of 
approximately one centimeter. This filtering operation eliminates single anomalous events due to highly 
localized film defects and questionable interpretations of events associated with the detection of anomaly 

40 sizes at the minimum spatial resolution of the film digitizing system. 

The final scan of the image with this macroclustering criterion generates the final map representing 
regions of interest in which possible anomalies have been detected. Due to the stochastic distribution of 
anomaly occurrences within a specific image 2 the boundary lines enclosing these regions of interest can 
be irregular and can extend beyond the one centimeter diameter threshold. 

45 With reference again to Figure 1 , if at least one cluster of anomalies is detected a signal is sent to a 
printer 36. The printer 36 produces a transparency overlay sheet 38 containing at least one marker 40 
which, when superimposed upon the original image 2, calls a viewer's attention to the location of possible 
anomalies. In the preferred embodiment the marker 40 takes the form of a closed contour centered on the 
location of an anomaly. The positioning of the closed contour may be accomplished by a conventional 

50 center of mass technique. 

If the analog image is digitized from a video signal, markers may be placed on an overlay transparency 
or a superimposable graphics layer to indicate the positions of suspected abnormalities on the video 
monitor displaying the original video image. 

However, if the user is unable to achieve specified percentages of true positives and false positives 

55 from iterative training of the neural network, it may be necessary to select different initial weights, e.g. by 
choosing a new random weight vector. One might also increase the number of hidden layers to reduce the 
error percentage or increase the number of nodes in the hidden layer to further reduce the error. 
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An adequately trained neural network comprises a set of trained weights which meet the specified 
requirements for the incidence of false positives and false negatives. 

Those skilled in the art, having the benefit of the teachings of the present invention as hereinbefore set 
forth, may effect numerous modifications thereto. Such modifications are to be construed as lying within the 
5 contemplation of the present invention, as defined by the appended claims. 

Claims 

1. A method for computer-aided detection of possible anomalies in a digitized image comprising a 
10 plurality of M x N picture elements each representing an optical density in the digitized image, the 

method comprising the steps of: 

a) subdividing the digitized image into a plurality of predetermined regions each comprising m x n 
picture elements, where m < M and n < N; 

b) subtracting background from each predetermined region of the digitized image; 
75 c) selecting a subregion comprising pxq picture elements, where p < m and q < n; 

d) normalizing the image data from the p x q subregion; 

e) using a neural network system, analyzing each predetermined subregion of the digitized image to 
recognize any pattern indicative of an occurrence of a possible anomaly; 

the neural network system comprising at least two member neural networks each trained to 
20 recognize a particular predetermined anomaly type within a predetermined size range and to 

produce an output signal value indicative of the presence of said predetermined anomaly type; 

f) comparing each of the output values of each of the member neural networks to a first 
predetermined threshold value corresponding to each member neural network above which the 
presence of a possible anomaly is indicated; 

25 g) comparing each output value that exceeds each first predetermined threshold(s) to each of the 

other output values that exceed each corresponding first predetermined threshold to select the 
maximum signal value; 

h) comparing the maximum signal value to a second predetermined threshold value above which the 
presence of a possible anomaly is indicated; 
30 i) based upon the comparisons of step h), determining the location within the digitized image of each 

possible anomaly; 

j) using a clustering analysis on the locations of possible anomalies of step i), identifying each 
vicinity on the digitized image at which a cluster of locations of possible anomalies occurs; and, 
k) creating a marker for each cluster, having a contour that surrounds all of the anomalies in each 
35 cluster, corresponding to the digitized location of each cluster. 

2. The method of claim 1 wherein the step k) itself comprises the step of placing the market on a 
transparency overlay sheet superimposable upon the original image. 

40 3. The method of claim 1 wherein each subregion of p x q picture elements is selected from the center of 
each m x n region. 

4. The method of claim 1 wherein the step b) comprises using a high pass spatial frequency filter. 

45 5. The method of claim 1 wherein the step d) comprises using a two-stage normalization, the first stage 
being a scale invariance. and the second stage being a dynamic range normalization. 

6. The method of claim 1 wherein the step j) comprises using a two-stage clustering algorithm, the first 
stage being a microclustering binding step, and the second stage being a macroclustering binding step. 

50 

7. The method of claim 1 wherein each first threshold value of step f) is different for each member neural 
network. 

8. The method of claim 1 wherein each first threshold value of step f) is the same for each member neural 
55 network. 
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